Finding Frequent Patterns in Parallel Point Processes

نویسندگان

Christian Borgelt

David Picado-Muiño

چکیده

We consider the task of finding frequent patterns in parallel point processes—also known as finding frequent parallel episodes in event sequences. This task can be seen as a generalization of frequent item set mining: the co-occurrence of items (or events) in transactions is replaced by their (imprecise) co-occurrence on a continuous (time) scale, meaning that they occur in a limited (time) span from each other. We define the support of an item set in this setting based on a maximum independent set approach allowing for efficient computation. Furthermore, we show how the enumeration and test of candidate sets can be made efficient by properly reducing the event sequences and exploiting perfect extension pruning. Finally, we study how the resulting frequent item sets/event sets can be filtered for closed and maximal sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Frequent Parallel Episodes with Selective Participation

We consider the task of finding frequent parallel episodes in parallel point processes, allowing for imprecise synchrony of the events constituting occurrences (temporal imprecision) as well as incomplete occurrences (selective participation). We tackle this problem with frequent pattern mining based on the CoCoNAD methodology, which is designed to take care of temporal imprecision. To cope wit...

متن کامل

A New Viewpoint for Mining Frequent Patterns

According to the traditional viewpoint of Data mining, transactions are accumulated over a long period of time (in years) in order to find out the frequent patterns associated with a given threshold of support, and then they are applied to practice of business as important experience for the next business processes. From the point of view, many algorithms have been proposed to exploit frequent ...

متن کامل

Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows

Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...

متن کامل

An Efficient Range Partitioning Method for Finding Frequent Patterns from Huge Database

Data mining is finding increasing acceptance in science and business areas that need to analyze large amounts of data to discover trends that they could not otherwise find. Different applications may require different data mining techniques. The kinds of knowledge that could be discovered from a database are categorized into association rules mining, sequential patterns mining, classification, ...

متن کامل

Parallel Association Rule Mining with Minimum Inter-Processor Communication

Existing parallel association rule mining algorithms suffer from many problems when mining massive transactional datasets. One major problem is that most of the parallel algorithms for a shared nothing environment are Aprioribased algorithms. Apriori-based algorithms are proven to be not scalable due to many reasons, mainly: (1) the repetitive I/O disk scans, (2) the huge computation and commun...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Finding Frequent Patterns in Parallel Point Processes

نویسندگان

چکیده

منابع مشابه

Mining Frequent Parallel Episodes with Selective Participation

A New Viewpoint for Mining Frequent Patterns

Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows

An Efficient Range Partitioning Method for Finding Frequent Patterns from Huge Database

Parallel Association Rule Mining with Minimum Inter-Processor Communication

عنوان ژورنال:

اشتراک گذاری